Port scan detection using destination profiles

ABSTRACT

A method, including identifying, in network data traffic, a set of pairs of source and destination nodes, each pair having a given source node, a given destination node, and one or more ports accessed in the traffic between the nodes in each pair, and computing, for each pair, a respective baseline that indicates a first number of the ports that source nodes other than the given source node in the pair accessed on the given destination node during a first period. For each pair, a respective test score is computed that indicates a difference between a second number of the ports that the given source node in the pair accessed on the given destination node during a second period and the baseline, and a preventive action is initiated with respect to the given source node in any of the pairs for which the test score is greater than a threshold.

FIELD OF THE INVENTION

The present invention relates generally to computer security, and particularly to detecting port scan attacks.

BACKGROUND OF THE INVENTION

In computer networking, a communication port is a logical communication endpoint on the network that, from a software standpoint, identifies a specific resource (e.g., a process or a type of service) executing on a given computer in the network. Communication ports (also referred to herein simply as ports or port numbers) are typically defined by a communications protocol. For example, ports are one of the Layer 4 (i.e., the Transport Layer) protocols in the Open Systems Interconnection (OSI) model, and are used to define network sessions in client-server application architectures.

Ports provide a multiplexing service for multiple services or multiple communication sessions at one network address. In operation, ports are part of the addressing information used to identify sources and destinations of messages transmitted over a network. Additionally, each “open” port is typically associated with a specific service such as have a service that is connected to them such as a database service, an email service or a communication service.

Network port scanning is a method for determining which ports on a network are open. Running a port scan on a network or server reveals which ports are open and configured to receive and/or send information. Network professionals can use port scanning tools to measure their exposure to attackers and to monitor devices and services. Hackers, on the other hand, scan ports to probe networks for open ports that may be exploitable and to map which services run on each device. For example, a hacker can send a message to multiple ports, and analyze the responses from each given port in order to determine if the port is being used, and if so, what service is using the given port.

Documents incorporated by reference in the present patent application are to be considered an integral part of the application except that to the extent any terms are defined in these incorporated documents in a manner that conflicts with the definitions made explicitly or implicitly in the present specification, only the definitions in the present specification should be considered.

The description above is presented as a general overview of related art in this field and should not be construed as an admission that any of the information it contains constitutes prior art against the present patent application.

SUMMARY OF THE INVENTION

There is provided, in accordance with an embodiment of the present invention, a method including identifying, in data traffic transmitted between multiple nodes that communicate over a network, a set of pairs of source and destination nodes, each pair consisting of a given source node and a given destination node, and one or more communication ports accessed in the data traffic between the source and destination nodes in each pair, computing, for each pair in the set, a respective baseline level that is indicative of a first number of the communication ports that source nodes other than the given source node in the pair accessed on the given destination node during a first time period, computing, for each pair in the set, a respective test score that is indicative of a difference between a second number of the communication ports that the given source node in the pair accessed on the given destination node during a second time period and the baseline level, and initiating a preventive action with respect to the given source node in any of the pairs for which the test score is greater than a specified level.

In some embodiments, identifying the pairs of source and destination nodes includes collecting, from a probe, data packets transmitted over the network, aggregating the collected data packets into communication sessions between a given source node and a given destination node, and identifying, in each the aggregated communication sessions, the given source node accessing at least one given communication port on the given destination node. In additional embodiments, the method also includes identifying one or more noisy scanners, each of the noisy scanners including a given source node that accesses at least a specified first number of destination ports on at least a second specified number of destination nodes. In one embodiment, computing the baseline level for a given pair includes computing a revised first number of the protocol ports that source nodes other than the given source node in the pair and other than any of the identified noisy scanners accessed on the given destination node during the first time period;

In further embodiments, the specified level includes a first specified level, initiating the preventive action includes initiating a first preventive action, and the method also includes identifying any failed connections in the port scans during the second time period, and initiating a second preventive action with respect to the given source node in any of the pairs having at least one of the identified failed connections and for which the test score is greater than a second specified level lower than the first specified level.

In one embodiment, the second time period is subsequent to the first time period. In another embodiment, the first and the second time periods have substantially identical time durations. In supplemental embodiments, the first time period includes multiple sub-periods, and computing the test score for a given pair includes computing separate baseline levels for each of the sub-periods, computing an average of the separate baseline levels, and subtracting the computed average from the second number of the communication ports. In some embodiments, the second time period and each of the sub-periods have substantially identical time durations.

In additional embodiments, initiating the preventive action includes generating an alert for the given source node. In further embodiments, initiating the preventive action includes restricting access of the given source node to the network.

There is additionally provided, in accordance with an embodiment of the present invention, an apparatus including a network interface device coupled to a data network including multiple nodes that communicate via the network, and at least one processor configured to identify, in data traffic transmitted between the multiple nodes over the network, a set of pairs of source and destination nodes, each pair consisting of a given source node and a given destination node, and one or more communication ports accessed in the data traffic between the source and destination nodes in each pair, to compute, for each pair in the set, a respective baseline level that is indicative of a first number of the communication ports that source nodes other than the given source node in the pair accessed on the given destination node during a first time period, to compute, for each pair in the set, a respective test score that is indicative of a difference between a second number of the communication ports that the given source node in the pair accessed on the given destination node during a second time period and the baseline level, and to initiate a preventive action with respect to the given source node in any of the pairs for which the test score is greater than a specified level.

There is additionally provided, in accordance with an embodiment of the present invention, a computer software product, the product including a non-transitory computer-readable medium, in which program instructions are stored, which instructions, when read by a computer, cause the computer to identify, in data traffic transmitted between multiple nodes that communicate over a network, a set of pairs of source and destination nodes, each pair consisting of a given source node and a given destination node, and one or more communication ports accessed in the data traffic between the source and destination nodes in each pair, to compute, for each pair in the set, a respective baseline level that is indicative of a first number of the communication ports that source nodes other than the given source node in the pair accessed on the given destination node during a first time period, to compute, for each pair in the set, a respective test score that is indicative of a difference between a second number of the communication ports that the given source node in the pair accessed on the given destination node during a second time period and the baseline level, and to initiate a preventive action with respect to the given source node in any of the pairs for which the test score is greater than a specified level.

BRIEF DESCRIPTION OF THE DRAWINGS

The disclosure is herein described, by way of example only, with reference to the accompanying drawings, wherein:

FIG. 1 is a block diagram that schematically shows a computing facility comprising a system configured to detect port scans suspected of being malicious, in accordance with an embodiment of the present invention;

FIG. 2 is a flow diagram that schematically illustrates a method of identifying, in data packets transmitted from source nodes to destination nodes over the network, suspicious port scans, in accordance with an embodiment of the present invention;

FIG. 3 is a flow diagram that schematically illustrates a method of generating a destination a profile score that can be used to detect port scans, in accordance with an embodiment of the present invention;

FIG. 4 is a flow diagram that schematically illustrates a method of generating a source profile that can be used to detect and whitelist aggressive and periodic scanners, in accordance with an embodiment of the present invention;

FIG. 5 is a flow diagram that schematically illustrates a method of identifying malicious port scans comprising port scans for different software systems in a single category, in accordance with an embodiment of the present invention;

FIG. 6 is a flow diagram that schematically illustrates a method of identifying malicious port scans comprising outlier pairs of scanned ports, in accordance with an embodiment of the present invention; and

FIG. 7 is a flow diagram that schematically illustrates a method of identifying and whitelisting scanner probes, in accordance with an embodiment of the present invention.

DETAILED DESCRIPTION OF EMBODIMENTS Overview

Embodiments of the present invention provide methods and systems for identifying port scans on a data network. As described hereinbelow, while monitoring data traffic transmitted between multiple nodes that communicate over a network, a set of pairs of source and destination nodes are identified, each pair consisting of a given source node and a given destination node, and one or more communication ports accessed in the data traffic between the source and destination nodes in each pair. For each pair in the set, a respective baseline level and a respective test score are computed. For each pair in the set, the respective baseline level is indicative of a first number of the communication ports that source nodes other than the given source node in the pair accessed on the given destination node a first time period, and the respective test score that is indicative of a difference between a second number of the communication ports that the given source node in the pair accessed on the given destination node during a second time period and the baseline level. A preventive action can be initiated with respect to the given source node in any of the pairs for which the test score is greater than a specified level.

Embodiments of the present invention also provide methods and systems for detecting if any of the identified port scans comprise an anomalous combination of ports that can indicate a malicious port scan. Examples of anomalous combination of ports include, but are not limited to, port pairs and port groups. As described hereinbelow, the analysis to detect the suspicious port scans may be based on source profiles, port profiles, port pair profiles and scanner probe profiles.

System Description

FIG. 1 is a block diagram that schematically shows a computing facility 20 comprising a malicious port scan detection system 22 that collects and monitors data packets 24 transmitted between multiple nodes 26 coupled to a data network 28 in order to identify malicious port scans, in accordance with an embodiment of the present invention. In embodiments described herein, each node 26 comprises any type of device (i.e., physical or virtual) that is configured to communicate over the network, and has an IP address assigned for this purpose. In the example shown in FIG. 1, the nodes comprise workstations 26 and a public network 30 such as the Internet. As described hereinbelow, embodiments of the present invention aggregate the data packets into communication sessions, identify any of the communication sessions that comprise port scans 32, and generate an alert for any of the port scans that are suspected of being malicious.

While the example shown in FIG. 1 shows the nodes comprising workstations 26, nodes 26 comprising other types of devices that communicate over network 28 and Internet 30 are considered to be within the spirit and scope of the present invention. For example, the nodes may comprise devices such as servers, wireless devices such as smartphones, routers and network switches.

Each workstation 26 may comprise, for example, a workstation identifier (ID) 34, a workstation processor 36, a workstation memory 38 that stores a plurality of communication ports 40 (also referred to herein simply as ports). Unlike physical ports, ports 40 are logical entities that are defined by a communications protocol such as TCP/IP.

Examples of workstation IDs 34 include, but are not limited to, a media access control (MAC) addresses and Internet Protocol (IP) addresses that can be used to uniquely identify each of the workstations. While any given time, each given workstation 26 is assigned a unique IP address, the given workstation may be associated with multiple IP addresses over an extended time period. For example, the IP address for a given workstation 26 may change after a reboot of the given workstation. Generally, in operation, processor 36 executes, from memory 38, an operating system 42 (e.g., Linux) and one or more software applications 44 (e.g., a database server).

In the configuration shown in FIG. 1, memory 38 also stores a whitelist 80 that stores the identifiers for one or more workstations 26. As described in the description referencing FIGS. 4 and 7 hereinbelow, embodiments of the present invention can ignore any suspicious port scan 32 that is initiated by any workstation 26 in the whitelist.

Workstations 26 communicate over data network 28 (e.g., a local area network) that is also coupled to an Internet gateway 46. Gateway 46 couples computing facility 20 to public networks 30 such as the Internet, and comprises communications circuitry (not shown) that enables communication between workstations 26 and sites/computers (not shown) on the Internet.

In some embodiments, malicious port scan detection system 22 comprises a system processor 48 and a system memory 50, which are coupled by a system bus (not shown) to a network interface controller (NIC) 52 that couples the computer system to network 28. In some embodiments, malicious port scan detection system 22 may comprise a user interface (UI) device 54 (e.g., an LED display) or another type of output interface.

In the configuration shown in FIG. 1, malicious port scan detection system 22 comprises a probe 56 that collects information on data packets 24 transmitted over network 28. While the example in FIG. 1 shows probe 56 as a module of malicious port scan detection system 22, the probe can be implemented either as a standalone device coupled to network 28, or as a module in another device coupled to the network. Probe optionally collects data packets 24 from network 28 and processes the collected data packets to extract information, using any of the methods described, in U.S. Patent Application 2014/0165207 to Engel et al. and U.S. Patent Application 2015/0358344 to Mumcuoglu et al., whose disclosures are incorporated herein by reference.

Memory 50 stores respective pluralities of communication sessions 68, aggregated communication sessions 58 and port lists 60. In embodiments described herein, processor 48 is configured to collect the data packets from probe 56, to group the data packets into communication sessions 68, to aggregate the communication sessions into aggregated communication sessions 58, and to identify any of the aggregated communication sessions that indicate a given port scan 32. The use of port lists 60, which store respective pluralities of ports 40 (i.e., port numbers), is described in the description referencing FIG. 5, hereinbelow.

In the configuration shown in FIG. 1, memory 50 also stores a whitelist 80 that stores the identifiers for one or more workstations 26. As described in the description referencing FIGS. 4 and 7 hereinbelow, embodiments of the present invention can ignore any suspicious port scan 32 that is initiated by any workstation 26 in the whitelist.

Each communication session 68 optionally comprises a source node identifier 64, a destination port identifier 66, a time 84, a source port identifier 70, a destination port identifier 72, a protocol 74, a status 76, a volume 88 (source to destination), a reverse-volume 78 (also referred to as rvolume, destination to source), and a time 84. Each aggregated communication session 58 optionally comprises a port scan time period 62, a subset 86 of the communication sessions, and a signature 82.

In each given communication session 68, source node 64 stores the identifier of a first given workstation 26, destination node 66 stores the identifier of a second given workstation 26, source port 70 refers to a given port 40 on the first given workstation that is being used to communicate with the second given workstation during the given communication session, the destination port 72 refers to a given port 40 on the second given workstation that is being accessed during the given communication session, the protocol 74 refers to a given communications protocol (e.g., NFS, SSH, KERBEROS, LDAP) that is used by the given communication session, the status 76 indicates whether the given communication session completed successfully, volume 88 indicates an amount of data transmitted from the first given workstation to the second given workstation during the given communication session, and reverse volume 78 indicates an amount of data transmitted from the second given workstation to the first given workstation during the given communication session.

In embodiments described herein, source node 64 may be used to refer to the first given workstation, and destination node 66 may be used to refer to the second given workstation. In embodiments where workstations communicate using TCP/IP, processor can identify the source and the destination ports for a given communication session 68 based on information stored in a given data packet 24 storing the TCP header.

For each aggregated communication session 58, the port scan time period 62 comprise specified time period (e.g., a specific number of hours or days), and subset 86 refers to a plurality of communication sessions 68. Signatures 82 are described in the description referencing FIG. 7, hereinbelow.

In some embodiments, the tasks of collecting the data packets, grouping the data packets into the communication sessions, aggregating the communication sessions and identifying the aggregated communication sessions that comprise port scans 32 may be split among multiple devices within computing facility (e.g., workstations 26) or external to the computing facility (e.g., a data cloud based application). In some embodiments, the functionality of some or all of workstations 26 and/or malicious port scan detection system 22 may be deployed in computing facility 20 as virtual machines.

Examples of memories 38 and 50 include dynamic random-access memories and non-volatile random-access memories. In some embodiments, the memories may comprise non-volatile storage devices such as hard disk drives and solid-state disk drives.

Processors 36 and 48 comprise general-purpose central processing units (CPU) or special-purpose embedded processors, which are programmed in software or firmware to carry out the functions described herein. This software may be downloaded to computers 22 and 26 in electronic form, over a network, for example. Additionally or alternatively, the software may be stored on tangible, non-transitory computer-readable media, such as optical, magnetic, or electronic memory media. Further additionally or alternatively, at least some of the functions of processors 36 and 48 may be carried out by hard-wired or programmable digital logic circuits.

Port Scan Collection

FIG. 2 is a flow diagram that schematically illustrates a method for identifying suspicious port scans 32 on network 28, in accordance with an embodiment of the present invention. In embodiments described herein, a suspicious port scan comprises a source workstation 26 that accesses an anomalous combination of communication ports 40 on a destination workstation 26 within a predetermined time period.

In step 90, processor 48 uses probe 56 to collect data packets 24 that are transmitted between nodes 26 on network 28 during a time period that comprises multiple sub-periods. For example, the time period may comprise seven consecutive days (i.e., one week), and each sub-period may comprise any 24 hour period (e.g., one day) during the week.

In step 92, processor 48 groups and stores the collected data packets as individual communication sessions 68 between respective pairs of source and destination nodes 26. The communication session typically comprises a sequence of data packets 24 that a first given workstation 26 transmits to a given port 40 on a second given workstation 26. Upon detecting a given sequence of data packets, processor 48 defines a new communication session 68, and stores, to the new communication session, the identifier for the first given workstation to source node 64, the identifier for the second given workstation to destination node 66, the date and time that the given sequence of data packets were collected to time 84, the port number for the first given workstation in the TCP header to source port 70, the port for the second given workstation in the TCP header to destination port 72, a communications protocol used by the sequence of data packets to protocol 74, a status (e.g., succeeded/failed) of the communication session to status 76, and a first amount of data (e.g., 600 bytes) that the first given workstation transmitted to the second given workstation in the sequence of data packets to volume 88.

In some instances, the sequence of data packets may also comprise a second volume of data (e.g., 200 bytes) that the second given workstation transmits to the first given workstation. Process 48 can store the second amount of data to rvolume 78.

In some embodiments, processor 48 can group the packets according to the IP addresses (not shown) in the packets, such that the system processor can group together packets 24 having the same source and destination addresses or having the same source address, source port, destination address, destination port and protocol. In an alternative embodiment, processor 48 can manage a table (not shown) which correlates between addresses in packets and respective IDs 34 of nodes 26, for example as described in U.S. Patent Application 2016/0234167, which is incorporated herein by reference, and groups together packets according to the IDs corresponding to the addresses in the packets. An example for grouping the collected data packets is described in U.S. patent application Ser. No. 15/950,234, filed Apr. 11, 2018, which is incorporated herein by reference.

In step 94, processor 48 aggregates the communication sessions into a plurality of aggregated communication sessions 58, so that each of the aggregated communication sessions comprises the data in the communication sessions for each unique pair of source and destination nodes that communicated with each other during a given sub-period. In embodiments of the present invention, each sub-period typically comprises a predefined time period (e.g., one hour, two hours or 24 hours).

When aggregating communication sessions 68, processor 48 can identify and flag any of the communication sessions to a given port 40 that failed. In embodiments herein, these flagged communication sessions may be referred to as failed connections. A communication session to a given port 40 can be flagged as a failed connection if no response is received from the given port, or if a response is received indicating that the given port is closed. A failed connection is typically a result of a faulty configuration of a given node 26, and a given port 40 can be identified as a failed port by detecting that there are no successful connections to the given port on the given node. For example, if given node 26 comprises an email server that is configured with a wrong IP address, other nodes 26 on the network will generate failed connections when they attempt to access a wrong destination port on the email server.

In the TCP/IP communications model, a successful communication session comprises (a) a given source node 64 transmitting a “SYN” command to a given destination node 66, (b) the given destination node transmitting a “SYN-ACK” command to the given source node in response to receiving the “SYN” command, and (c) the given source node transmits an “ACK” command to the given destination node in response to receiving the “SYN-ACK” command. In embodiments of the present invention, processor 48 can identify a failed connection by detecting a given communication session 68 that is missing a “SYN-ACK” command transmitted from a given destination node 66 to a given source node 64 and/or is missing an “ACK” command transmitted from the given source node to the given destination node.

In embodiments of the present invention, processor 48 can use failed connection information to determine if any of the aggregated communication sessions comprise any port scans. For example, if all the communication sessions in a given aggregated communication session 58 are successful (i.e., have successful transmissions of the “SYN”, “SYN-ACK” and “ACK” commands), them there is a low likelihood that the given aggregated communication session comprises a port scan. However, if all the connections in the given aggregated communication session comprise failed connections on different ports 40 (as detected using embodiments described supra), then there is a high likelihood that the given aggregated communication session comprises a port scan.

In step 96, processor 48 “cleans” the data in port scan records in order to retain the data that is relevant for analysis. In one embodiment, processor 48 can clean the data by filtering out any of the communication sessions comprising port scans having source ports 70 and protocols 74 that are known to have activity in numerous destination ports 72. For example, based on parameters provided by a systems administrator, processor 48 can filter out any of the port scans whose protocol is NFS and whose source port numbers are either “829”, “2049” or “20048”. In a another embodiment, a given port list 60 may comprise a set of ports 40 that are used by services available on network 28, and processor 48 can filter out any scans of ports 40 in the given port list.

In step 98, processor 48 identifies one or more aggregated port communication sessions 58 that comprise respective port scans 32. In some embodiments, processor 48 can use destination profiles to identify a given port scan, as described in the description referencing FIG. 3 hereinbelow.

In step 100, in response to identifying the port scans in step 88, processor 48 can initiate, for the source node in each identified port scan 32, a first preventive action. In one embodiment, processor 48 can initiate the first preventive action by presenting, on user interface device 54, an alert message indicating that the identified source node is performing suspicious port scans. In another embodiment, processor 48 can initiate the first preventive action by restricting the identified source node from accessing network 28 (e.g., by conveying an instruction to a network switch or a firewall coupling the identified source node to network 28).

In an additional embodiment, processor 48 can initiate the first preventive action by transmitting the identifier of the given source node to an alert management system (not shown) such as a security information and event management (SIEM) system. In a further embodiment, processor 8 can generate the alert by storing the identifier of the given source node to a data structure (not shown) that an alert management system (e.g., a SIEM system) can extract via an API (not shown).

In one variation of the embodiments described hereinabove, processor 48 can identify a user (e.g., via login credentials) of the source node in an identified port scan, and initiate the preventive action with respect to the given user. In another variation of the embodiments described hereinabove, processor 48 can identify, on the source node in an identified port scan, a software process that accessed the ports in the identified port scan, and initiate the preventive action with respect to the software process.

In step 102, processor 48 identifies a given identified port scan that comprises a given source node 64 that scanned an anomalous combination of destination ports 72 on a given destination node 66 during the time period (i.e., a test period). Different embodiments for detecting the anomalous combinations are described hereinbelow in the respective descriptions referencing FIGS. 4-7. The port scan identified in step 90 may also be referred to herein as a suspicious port scan.

Finally in step 104, in response to identifying the anomalous port scans in step 102, processor 48 can initiate a second preventive action for the source nodes in the anomalous port scans, and the method ends. Examples of preventative actions are described supra.

Destination Profiles

In embodiments of the present invention, processor 48 can use destination profiles to detect port scans 32. As described hereinbelow, processor 48 can generate, based on data packets 24 collected during a specified time period, destination profiles for each given destination node 66 that indicates a typical number of ports 40 (i.e., destination ports 72) scanned on the given destination node, and use the destination profiles to detect any subsequently collected port scans that are anomalous.

FIG. 3 is a flow diagram that schematically illustrates a method for computing destination profile scores, and using the computed scores to identify port scans 32, in accordance with an embodiment of the present invention. In step 110, using embodiments described in the description referencing FIG. 2 hereinabove, processor 48 identifies a set of port scans. To identify the set of port scans, processor 48 collects communication sessions 68 and aggregates them into aggregated communication sessions 58. Each aggregated communication session 58 comprises a given port scan 32 having a first given workstation 26 accessing at least one given communication port 40 on a second given destination 26.

Processor 48 collects the communication sessions during multiple time periods that include a training period (also referred to herein as a first time period) and a test period (also referred to herein as a second time period). The test and training periods may have substantially identical (e.g., within 10%) time durations. For example, the test and training periods may comprise 24 hour periods. In some embodiments, the test period is subsequent to the training period. In additional embodiments, the training and the test periods may overlap partially of completely (i.e., the same time period).

In step 112, processor 48 identifies any of the source nodes in the aggregated communication sessions that are “noisy scanners”. In embodiments of the present invention, a given source node 64 can be classified as a noisy scanner if the given source node accesses (i.e., “scans”) at least a first number (e.g., at least 20, at least 25, at least 30, at least 35, or at least 40) of destination ports 72 on at least a second number (e.g., 80, 90, 100, or 110) of destination nodes 66 during the training period. In some embodiments, the second number is greater than the first number. As described hereinbelow, processor 48 can ignore any source node 64 that the system processor classified as a noisy scanner.

In step 114, processor 48 computes, for each pair of a given source node 64 and a given destination node 66 in the aggregated communication sessions, a baseline score (also referred to herein as a baseline level) that indicates a typical number of ports 40 that remaining first source nodes (i.e., excluding the given source node and in some embodiments, any of the source nodes that identified as noisy scanners) accessed on the given destination node during a given sub-period (e.g., one day) in the training period. In some embodiments, processor 48 can use the following formula for each of the source node 66 and destination node 66 pairs (i,j) to compute baseline scores:

$\begin{matrix} {{Baseline}_{i,j} = {\frac{1}{D}*{\sum\limits_{d \in {{baseline}\_{days}}}{\frac{1}{{L_{j}^{d}(i)}}{\sum\limits_{k \in {L_{j}^{d}{(i)}}}^{\;}\left( P_{k,j}^{d} \right)}}}}} & (1) \end{matrix}$ where

-   L_(j) ^(d)(i)—set of the source nodes of the destination node j in     day d (i.e., a given sub-period) excluding {i, noisy_scanners}. -   P_(k,j) ^(d)—a number of distinct destination ports 72 between the     source node k and the destination node j on day d. -   D—a number of baseline days d in the training period.

In operation, processor 48 can compute Equation (1) for a single training period D or for a training period having multiple sub-periods D. In embodiments with a single period D, the training and the test periods may have substantially identical time durations, and in embodiments with multiple periods D, the sub-periods and the test periods may have substantially identical time durations.

In step 116, processor 48 computes, for each pair of a given source node 64 and a given destination node 66 in the second aggregated communication sessions, a destination profile score that can be used to identify, based on the destination ports on the destination nodes accessed by the source nodes during the training and the test periods, any of the source nodes that are suspected of performing port scans 32. For example, processor 48 can compute, for each pair (i,j) identified during the test period, the following destination profile score: Score_(i,j) =P* _(i,j)−Baseline_(i,j)  (2) where P*_(i,j) comprises a number of destination ports 72 that the source node i accessed on the destination node j during the test period. In embodiments of the present invention, a higher destination profile score for a given pair (i,j) indicates that number of ports 40 that a given source node i scanned on a given destination node j during the test period was greater than the ports on the given destination node that the given source node scanned during the training period. A higher Score_(i,j) indicates a higher probability that the source node i is performing a port scan on the destination node j.

Finally, in step 118, processor 48 can identify a given pair of source and destination nodes whose destination profile score exceeds a specified threshold (i.e., a level), thereby indicating suspicious port scans, and the method ends. In one embodiment the threshold may comprise a large score value (e.g., 7, 8, 9 or 10) for the score. In another embodiment the threshold may comprise a low score value (e.g., 4, 5 or 6) and the number of failed connections between the source and destination nodes during the test period is greater than a low failed connection value (e.g., 0, 1 or 2).

Source Profile Generation

In a second anomalous port scan detection embodiment, processor 48 can use source profiles to detect potentially malicious port scans. As described hereinbelow, processor 48 can generate, based on ports scans 24 collected during a specified time period, a source profile for each given source node 64 that indicates nodes whether or not a given source node is either an aggressive scanner or a periodic scanner. In embodiments of the present invention, scans from aggressive and periodic scanners are not considered to be suspicious, and the aggressive and periodic scanners can be whitelisted.

Computer networks such as network 28 typically comprise workstations 28 that can execute processes that perform legitimate port scans or perform legitimate activities that resemble ports scans (i.e. with a different intention). Since these services or activities sometimes originate from the same source node 64, embodiments of the present invention can generate and use source profiles to detect these source nodes in order to whitelist their legitimate port scanning activity.

FIG. 4 is a flow diagram that schematically illustrates a method for computing source profiles, and using the computed source profiles to detect and whitelist any source nodes 64 that are aggressive or periodic scanners, in accordance with an embodiment of the present invention. In step 120, using embodiments described in the description referencing FIG. 2 hereinabove, processor 48 identifies a set of port scans. To identify the set of port scans, processor 48 collects, during a timespan comprising multiple predefined time periods, communication sessions 68 and aggregates them into aggregated communication sessions 58. Each aggregated communication session 58 comprises a given port scan 32 having a first given workstation 26 accessing at least one given communication port 40 on a second given destination 26 during a given time period. The predefined time periods may have substantially identical time durations (e.g., one day).

In step 122, processor 48 computes, for each given source node “i” in the port scans, scanned_dests_average_(i) that indicates an average number of destination nodes 66 whose respective communication ports 40 were accessed by the given source node during any given scan by the given source node. In some embodiments, scanned_dests_average_(i) comprises an average number of the destination nodes that the given source node scanned per time period, omitting time periods where no scans were performed by the given source node.

In step 124, processor 48 computes for each given source node “i” in the port scans, for the given source node i,

${{scan\_ ratio}_{i} = \frac{scan\_ days}{D}},$ which indicates a fraction of the time periods D during which the given source node accessed at least one of the destination nodes in at least one of the port scans carried out by the given source node.

In step 126, processor 48 whitelists, based on the computed scanned_dests_average_(i) averages and scan_ratio_(i) fractions, any of the source nodes that are classified either as aggressive or periodic scanners, as described in the criteria hereinbelow, To whitelist a given source node 64, processor 48 adds the given source node (i.e., the corresponding port number) to whitelist 80.

In embodiments of the present invention, an aggressive scanner can be defined as a given source node 64 that scans a few destination nodes 66 during every time period (e.g., every day). For example, an aggressive scanner might scan a database server and a web server (i.e., two different destination nodes) every hour to check on their respective statuses. In some embodiments, for each given source node 64, processor 48 can first identify scan_days_(i) as a number of days the given source node performed at least one scan, and can classify the given source node as an aggressive scanner if ∀i:scanned_dests_average_(i) exceeds a first low threshold (e.g., 2, 3, 4, 5, 6, 7) and/or scan_ratio_(i) exceeds a first high threshold (e.g., 0.5, 0.6, 0.7, 0.8).

For example, if the first low threshold is 3, the first high threshold is 0.5, and the daily number of destination nodes 66 scanned by a given source node 64 is [3,0,4,4,6,3], then the given source node is an aggressive scanner since scan_days_(i)=5, scanned_dests_average_(i)=4, and scan_ratio_(i)=0.833.

In embodiments of the present invention, a periodic scanner can be defined as a given source node 64 that scans many destinations with less frequency (e.g., once a week). For example, a periodic scanner may scan ports 40 on all the nodes (e.g., workstations 26) on network 28 on a weekly basis to see if there are any changes such as if any new ports 40 are open or if there are any respective vulnerabilities in the nodes. In a manner similar to detecting aggressive scanners, for each given source node 64, processor 48 can first identify scan_days_(i), and can classify the given source node as a periodic scanner if ∀i:scanned_dests_average_(i) exceeds a second high threshold (e.g., 10, 15, 20, 25, 30, 35) and/or scan_ratio_(i), exceeds a second low threshold (e.g., 0.10, 0.15, 0.2, 0.25).

For example, if the second high threshold is 30, the first second low threshold is 0.1, and the daily number of destination nodes 66 scanned by a given source node 64 is [0,0,1314,0,0,0], then the given source node is a periodic scanner since scan_days_(i)=1, scanned_dests_average_(i)=1314, and scan_ratio_(i)=0.14.

In one embodiment, processor 48 can receive an input (e.g., from a system administrator) that specifies the first and second low thresholds and the first and the second high thresholds. In another embodiment, processor 48 can dynamically set these thresholds based on the respective distributions of the computed values (i.e., scanned_dests_average_(i) and scan_ratio_(i)). For example, processor 48 can dynamically set the threshold based on (e.g., a fixed percentage) of outliers in the respective distributions of the computed values.

Returning to the flow diagram, in step 126, processor 48 identifies any of the source nodes in the port scans (i.e., that were identified in step 120) that are not in whitelist 80, and the method ends.

In one embodiment, processor 48 can perform step 128 during any given time period in order to identify a given non-whitelisted source node that performed a port scan during the given time period. In another embodiment, the time periods comprise one or more first time periods followed by a second time period, and processor 48 can perform steps 120-126 on the one or more first time periods, and perform step 128 on the second time period.

Port Profiles

Embodiments described herein can use port profiles to detect potentially malicious port scans. Port profiles indicate which combinations of ports 40 are not likely to be a part of “normal” user activity, but rather part of a network scan. The concept behind port profiles is that there are combinations of ports that are suspicious if they are scanned during a short period of time (e.g., one day). For example, if a legitimate user wants to access a specific network service provided by a given workstation 26 on network 28, the user typically knows what software application is providing the service, and any port(s) 40 the software application is using.

In a first port profile embodiment, the service (also referred to herein as a software category) comprises an operating system. For example, if the user wants to communicate with a given workstation running the Windows™ operating system (produced by Microsoft Corporation, Redmond, Wash.), the user can use port number “3389” which is for the remote desktop protocol (RDP) service. However, if the user tries to communicate with the given workstation via port number “22”, then that may be suspicious since port number “22” is typically used by secure shell (SSH) service, which is a service in the Linux™ operating system and rarely exists in Windows™ operating systems.

In a second port profile embodiment, the service comprises database management systems (DBMS). In operation, a first given workstation 26 communicates with a DBMS application executing on a second given workstation 26 via a given port 40 on the second given workstation that is associated with the DBMS application. In this embodiment, a suspicious port scan may comprise the first given workstation communicating with a large number of ports 40 (i.e., on the second given workstation) that are associated with a corresponding large number of different DBMS applications. This type of activity may be caused by an attacker conducting a service enumeration, which, for example, tries to identify all the available DBMS applications on a specific server.

It is important to note that suspicious port scan activity is different in the two embodiments described supra. In the operating system embodiment, a small number of port scans that cross different operating system port groups may be suspicious. This is because a given workstation 26 typically executes a single operating system. However, in the DBMS embodiment, a suspicious port scan may require a large number of port scans that cross different DBMS port scan groups in order to be labeled as suspicious. This is because a given workstation 26 may execute more than one DBMS application.

In the first port profile embodiment, processor 48 can define a plurality of port lists 60 for a corresponding plurality of operating system 42. Each port list 60 comprises multiple port numbers 40 that are commonly used by a given operating system 42. Therefore, each given port list 60 for a given operating system 42 comprises port number 40 that are typically used by the given operating system, and are either never or rarely used by other operating systems 42. Examples of operating systems 42 that can have respective port lists 60 include, but are not limited to Windows™ (produced by Microsoft Corporation, Redmond, Wash.), Linux™, Android™ (produced by Alphabet Inc., Mountain View, Calif.), macOS™ (also known as OS-X™, produced by Apple Inc., Cupertino Calif.).

For example:

-   -   A first given port list 60 may include the port numbers “135”,         “137” and “139”. These ports 40 are typically used by Windows™         services.     -   A second given port list 60 may include the port numbers “22”,         “23” and “111”. These ports 40 are typically used by Linux™         services.

The rationale for defining the port lists in the first port profile embodiment is that an attacker typically does not know the operating system executing on a given workstation 26 that they are scanning, and one goal of the attacker is to identify operating system 42. Therefore, the attacker may scan a few ports 40 from more than one port list 60 in order to identify the operating system executing on the given workstation.

For example, if a first given list 60 comprises ports used by Windows™, a second given 60 comprises ports used by Linux™ and a third given list 60 comprises ports used by macOS™, then for each source node 66 and destination node 66 pair, processor can compute a tuple (N_Windows, N_Linux, N_macOS) that represent respective counts of the port numbers in the port lists that, during a test period (there is no need for a training period) were scanned on the given destination node by the given source node. In this example:

-   Processor 48 would not flag a tuple like (4,0,0), since the given     destination node is probably running Windows™. -   Processor 48 would flag a tuple like (4,2,3), since the given source     node is trying to access ports 40 that usually used by macOS™ but     are rarely used by Windows™.

In some embodiments, processor 48 can use specified thresholds for the mix of counts in the tuples to identify suspicious port scans 32 that “cross” a plurality of operating systems. In a first example, processor 48 can flag the port scans in a given tuple as suspicious if the given tuple indicates a threshold number (e.g., >3, >4 or >5) of scans of ports 40 that are associated with one of the operating systems, and positive numbers of scans of any the ports associated with the remaining operating systems. In another example, processor 48 can flag the port scans in a given tuple as suspicious if the given tuple indicates respective large numbers (e.g., >3, >4 or >5) of scans of ports 40 that are associated with least 2 different operating systems. In the first example, processor can flag a port scan that results in the tuple (4,1,2) as suspicious, and in the second example, the processor can flag the port scan that results in the tuple (0,4,3) as suspicious.

In additional embodiments, processor 48 can transform the tuples into probabilities that the processor can use to identify suspicious port scans. For example, processor 48 can compute probabilities_tuple=[p₁, p₂, . . . , p_(n)] where

$\begin{matrix} {{p_{i} = \frac{n_{i}}{\Sigma\; n_{j}}},{n_{i} \in {ports\_ tuple}}} & (3) \end{matrix}$

There may be instances where the port values are small and the probabilities are suspected to be inaccurate. In other words, even though a given port 40 was not previously accessed, its probability of being accessed in the future is not zero. In one embodiment, processor 48 can use methods such as confidence interval or Laplace smoothing in order to improve estimation. In another embodiment, processor 48 can compute an entropy of probabilities_tuple for a given tuple, and flag the port scans in the tuple as suspicious (i.e., in that they are accessing a suspicious combination of the ports in more than one of the sets) if the entropy exceeds a specified threshold (e.g., 0.1, 0.2).

In the second port profile embodiment, processor 48 can define a plurality of port lists 60 for a corresponding plurality of software applications 44. Each port list 60 comprises multiple port numbers 40 that are commonly used by a specific family of software applications 44. Therefore, each given port list 60 for a given software application 44 comprises ports that are typically used by the given software application, and are either never or rarely used by other software applications 44. In the second port profile embodiment, examples of families (also known as categories) of software applications 44 include, but are not limited to, database services, email services and remote access services (also known as remote session services).

For example, if the family of software application 44 comprises database servers, then the port list for the database servers may comprise:

-   -   A first given port list 60 comprises one or more port numbers 40         for MySQL™ (e.g., “3306”).     -   A second given port list 60 comprises one or more port numbers         40 for Firebird™ (e.g., “1433”).     -   A third given port list 60 comprises one or more port numbers 40         for PostgreSQL™ (e.g., “5432”).     -   A fourth given port list 60 comprises one or more port numbers         40 for MongoDB™ (e.g., “27017”).     -   A fifth given port list 60 comprises one or more port numbers 40         for Cassandra™ (e.g., “9042”).     -   A sixth given port list 60 comprises one or more port numbers 40         for MemcacheDB™ (e.g., “11211”).     -   A seventh given port list 60 comprises one or more port numbers         40 for Aerospike™ (e.g., “3100”).

Typically a given node (e.g., a given workstation 26 or a server) might execute a small number (e.g., 1-3) different database server engines. Therefore, if processor 48 detects that a given source node 64 is scanning, on a given destination node 66, at least a threshold number (e.g., at least 3, at least 4 or at least 5) of ports 40 from different port lists 60 for database servers, this may indicate that given source node is looking for “any” database server, and therefore does not know which one is executing on the given destination profile. When detecting a large number of ports scanned from different port lists 60 for a given network service, having zero or a few few (e.g., less that 2, less than 3 or less than 4) successful sessions can increase suspiciousness.

In some embodiments, processor 48 can use additional criteria such as a number of detected failed connections correlated to different ports 40. In one example, processor 48 can flag (i.e., as suspicious) a port scan that scans a large number (e.g., at least four or at least five) of ports 40 from different port lists 60 for database servers. In another example, processor 48 can flag a port scan that scans a small number (e.g., at least two or at least three) of ports 40 from different port lists 60 for database servers as suspicious wherein at least one of the port scans has a failed connection (as described supra). Note that these examples are typically for port scans that are performed within a short timeframe (e.g., less than one hour, less than two hours or less than three hours).

In a first embodiment, the threshold may comprise a large number such as at least 5, at least 6 or at least 7. In a second embodiment, the threshold may comprise a small number (e.g., at least 2, at least 3 or at least 4) of ports in different port lists, and at least 1 failed connection on any of the port numbers in any of the port lists (i.e., for the family). The port scans in the first and second embodiments are typically within a short time period (e.g., one, two or three hours).

FIG. 5 is a flow diagram that schematically illustrates a method of using port profiles to detect cross software system port scans, in accordance with an embodiment of the present invention. In step 130, processor 48 defines a plurality of software systems in a specific software category, and in step 132, the system processor defines, for each given software system, a given port list 60 comprising a set of one or more ports 40 that are used exclusively by the given software system. Therefore, each port lists 50 comprise at least first and second disjoint sets of communication ports 40 (i.e., port numbers). The category may comprise operating systems or software applications that provide network services such as database servers or email servers. As described supra, if the family is operating systems, then each port list 60 comprises one or more ports 40 used by an operating system such as Windows™, Linux™ or macOS™. Likewise if the family is DMBS applications, then each port list 60 comprises one or more ports 40 used by a DBMS application such as MySQL™, PostgreSQL™ or Cassandra™.

In step 134, using embodiments described in the description referencing FIG. 2 hereinabove, processor 48 identifies a set of port scans. To identify the set of port scans, processor 48 collects, during a predefined time period (e.g., one hour or one day), communication sessions 68 and aggregates them into aggregated communication sessions 58. Each aggregated communication session 58 comprises a given port scan 32 having a first given workstation 26 accessing at least one given communication port 40 on a second given destination 26.

Finally, in step 136, using embodiments described hereinabove, processor 48 identifies, in the identified port scans (i.e., in step 134), a given source node 64 that accesses at least one of the communication ports in a first port list 60 and at least one of the communication ports in a second port list 60, and the method ends.

Deviation from Independent Model

Embodiments described herein can compute a distribution of port usage in network 28, and use the computed distribution to identify suspicious port scans on the network. For example, during a training period, processor 48 can detect that the port numbers “22” and “3389” are used frequently, but rarely together. During a subsequent test period, if processor 48 detects that a given source node 64 scanned, those two ports 40 on a given destination node 66, then the system processor can generate an alert for the given source node.

FIG. 6 is a flow diagram that schematically illustrates a method of detecting port scans 32 comprising outlier pairs of ports 40, in accordance with an embodiment of the present invention. In step 140, using embodiments described in the description referencing FIG. 2 hereinabove, processor 48 identifies a set of port scans. To identify the set of port scans, processor 48 collects, during a predefined time period, communication sessions 68 and aggregates them into aggregated communication sessions 58. Each aggregated communication session 58 comprises a given port scan 32 having a first given workstation 26 accessing at least one given communication port 40 on a second given destination 26.

In step 142, processor 48 computes, for each given port p scanned during the predefined time period, a probability P_(p) that that a given source node 64 accessed a given port p on a given destination node 66 in any port scan 32 during the predefined time period.

In step 144, processor 48 computes, for each pair of ports p1 and p2, a joint probability JP_(p1,p2) of a connection between a given source node 64 and the ports p1 and p2 on a given destination node 66 in any port scan 32 during the predefined time period.

Upon computing JP_(p1,p2) for each pair of ports 40 that were scanned during the training period, in step 146, processor 48 computes a Port Pair Score (PPS) that the system processor can use to identify pairs of ports p1 and p2 that have the following characteristics:

-   Port p1 is scanned frequently by any given source node 64 during the     predefined time period. -   Port p2 is scanned frequently by any given source node 64 during the     predefined time period. -   A given source node 64 rarely scans both ports p1 and p2 on a given     destination node 66 during the predefined time period.

To compute the Port Pair Score, processor 48 can use the following formula

$\begin{matrix} {{PPS}_{{p1},{p2}} = {\frac{P_{p\; 1}*P_{p\; 2}}{{JP}_{{p\; 1},{p\; 2}}}*\left( {1 - {{P_{p\; 1} - P_{p\; 2}}}} \right)}} & (3) \end{matrix}$

In Equation (3), higher PPS scores indicate a pair of ports 40 that are (each) frequently scanned on the network, but are rarely scanned together on a given destination node 66 by a given source node 64 during the predefined time period. In embodiments of the present invention, the threshold for a high PPS score can be a high value. For example the threshold can be greater than 20, greater than 30 or greater than 40.

Finally, in step 148, processor 48 identifies any of the source nodes that, during the predefined time period, scanned a pair of ports 40 having a high Port Pair Score, and the method ends. In embodiments of the present invention, a scanned a pair of ports 40 having a high Port Pair Score indicates that respective JP_(p1,p2) for the pair of ports p1 and p2 is lower than a threshold dependent upon the respective probabilities P_(p) of ports p1 and p2.

In one embodiment, the predefined time period may comprise multiple sub-periods that may have substantially identical time durations. In this embodiment, processor 48 can perform step 150 during any given sub-period in order to identify a given source node 64 that, during the given sub-period, scanned a pair of ports 40 having a high Port Pair Score. In another embodiment, the sub-periods comprise one or more first sub-periods followed by a second sub-period, and processor 48 can perform steps 140-146 on the one or more first sub-periods, and perform step 148 on the second sub-period.

Scanner Probes

Some scanning tools use a port scanning probe that comprises a given software application 44 loaded on one or more nodes 26 and is configured to scan other nodes 26 on the network, and to report results of a scan to a scanning server (e.g., a given node 26). Scanning probes can be deployed in networks having nodes 26 that the scanning server cannot access directly with all the ports required for the scan (e.g., due to a firewall protecting a subset of the network). In operation, probes can be deployed on numerous network endpoints (i.e., nodes 26) to randomly perform port scans, and then transmit results of the scans back to a given node (i.e., a server). Since scans performed by scanner probes may generate alerts, embodiments of the present invention enable processor 48 to whitelist scans performed by a given scanner probe.

FIG. 7 is a flow diagram that schematically illustrates a method detecting any deployed scanner probes, in accordance with an embodiment of the present invention. In step 150, using embodiments described in the description referencing FIG. 2 hereinabove, processor 48 identifies a set of port scans. To identify the set of port scans, processor 48 collects, during a predefined time period, communication sessions 68 and aggregates them into aggregated communication sessions 58. Each aggregated communication session 58 comprises a given port scan 32 having a first given workstation 26 accessing at least one given communication port 40 on a second given destination 26.

In step 152, processor 48 identifies, in the identified port scans, a group of high traffic ports 40. In embodiments of the present invention, processor 48 can classify a given port 40 as having high traffic if the amount data traffic passing through the given port during the predefined time period exceeds a predefined threshold. Examples of predetermined thresholds include, but are not limited to 200, 400 and 600 bytes. In some embodiments, the given port can be on a given node 26. In other words processor 48 can classify the combination of the given node and the given port as having high traffic.

In operation, processor 48 can use volume 88 and/or rvolume in the communication sessions of the aggregated port scan (i.e., corresponding to a given port scan 32) to determine if the data traffic in a given port scan 32 exceeds the predefined threshold. In some embodiments, processor 48 can classify a given port 40 as having high traffic if the maximum amount of data passing through the given port in any given communication session (i.e., during a given port scan 32) exceeds the predefined threshold.

In step 154, processor 48 generates, for the identified port scans, respective signatures 82 indicative of the communication ports other than the high-traffic ports that were accessed in each of the port scans. In other words, a given signature 82 for a given port scan 32 may comprise a set of the communication ports that were accessed during the given port scan and that were not classified as having high traffic.

In step 156, processor 48 computes a respective frequency of occurrence of each of the signatures over the set of the port scans, and in step 158 the processor assembles whitelist 80 by initializing the whitelist and then adding, to the whitelist, the signatures for which the respective frequency of occurrence is greater than a predefined threshold. In one embodiment, the frequency of occurrence for a given signature 82 may include information such as:

-   A number of occurrences that the given signature appeared in all the     identified port scans. -   A number of sources that performed port scans 32 having identical     signatures 82 (i.e. the set of non-high volume ports) to the given     signature. -   A number of destinations having a set of ports that were scanned     identical to the set of ports in the given signature.

In this embodiment, examples of specific thresholds include, but are not limited to:

-   The number of occurrences>a first threshold such as 8, 10 or 12. -   The number of sources>a second threshold such as 0, 1 or 2. -   The number of sources<a third threshold such as 30, 40 or 50. -   The number of destinations>a fourth threshold such as 0, 1, 2 or 3. -   The number of destinations<a fifth threshold such as 10, 20, 30 or     40     In some embodiments, processor 48 can use a combination of the     thresholds to identify the signatures to add to the whitelist. For     example, a given combination may be:

Number of occurrences>10 AND

Number of sources>1 AND

Number of sources <40 AND

Number of destinations>2 AND

Number of destinations<20.

Finally, in step 160, processor 48 identifies any of the source nodes in the identified port scans having respective signatures not in the whitelist, and the method ends.

In one embodiment, the predefined time period may comprise multiple sub-periods that may have substantially identical time durations. In this embodiment, processor 48 can perform step 160 during any given sub-period in order to identify, in the given sub-period, a identified port scan 32 having respective signatures not in the whitelist. In another embodiment, the sub-periods comprise one or more first sub-periods followed by a second sub-period, and processor 48 can perform steps 150-158 on the one or more first sub-periods, and perform step 160 on the second sub-period.

It will be appreciated that the embodiments described above are cited by way of example, and that the present invention is not limited to what has been particularly shown and described hereinabove. Rather, the scope of the present invention includes both combinations and subcombinations of the various features described hereinabove, as well as variations and modifications thereof which would occur to persons skilled in the art upon reading the foregoing description and which are not disclosed in the prior art. 

The invention claimed is:
 1. A method, comprising: identifying, in data traffic transmitted between multiple nodes that communicate over a network, a set of pairs of source and destination nodes, each pair consisting of a given source node and a given destination node, and one or more communication ports accessed in the data traffic between the source and destination nodes in each pair; identifying one or more noisy scanners, each of the noisy scanners comprising a given source node that accesses at least a specified first number of destination ports on at least a second specified number of destination nodes; computing, for each pair in the set, a respective baseline level that is indicative of a first number of the communication ports that source nodes, other than the given source node in the pair and other than any of the identified noisy scanners, accessed on the given destination node during a first time period; computing, for each pair in the set, a respective test score that is indicative of a difference between a second number of the communication ports that the given source node in the pair accessed on the given destination node during a second time period and the baseline level; and initiating a preventive action with respect to the given source node in any of the pairs for which the test score is greater than a specified level.
 2. The method according to claim 1, wherein identifying the pairs of source and destination nodes comprises collecting, from a probe, data packets transmitted over the network, aggregating the collected data packets into communication sessions between a given source node and a given destination node, and identifying, in each the aggregated communication sessions, the given source node accessing at least one given communication port on the given destination node.
 3. The method according to claim 1, wherein the specified level comprises a first specified level, wherein initiating the preventive action comprises initiating a first preventive action, and comprising identifying any failed connections in the port scans during the second time period, and initiating a second preventive action with respect to the given source node in any of the pairs having at least one of the identified failed connections and for which the test score is greater than a second specified level lower than the first specified level.
 4. The method according to claim 1, wherein the second time period is subsequent to the first time period.
 5. The method according to claim 1, wherein the first and the second time periods have substantially identical time durations.
 6. The method according to claim 1, wherein the first time period comprises multiple sub-periods, and wherein computing the test score for a given pair comprises computing separate baseline levels for each of the sub-periods, computing an average of the separate baseline levels, and subtracting the computed average from the second number of the communication ports.
 7. The method according to claim 6, wherein the second time period and each of the sub-periods have substantially identical time durations.
 8. The method according to claim 1, wherein initiating the preventive action comprises generating an alert for the given source node.
 9. The method according to claim 1, wherein initiating the preventive action comprises restricting access of the given source node to the network.
 10. An apparatus, comprising: a network interface controller coupled to a data network comprising multiple nodes that communicate via the network; and at least one processor, each given processor comprising a general-purpose central processing unit (CPU) or a special-purpose embedded processor, which is programmed in software or firmware: to identify, in data traffic transmitted between the multiple nodes over the network, a set of pairs of source and destination nodes, each pair consisting of a given source node and a given destination node, and one or more communication ports accessed in the data traffic between the source and destination nodes in each pair, to identify one or more noisy scanners, each of the noisy scanners comprising a given source node that accesses at least a specified first number of destination ports on at least a second specified number of destination nodes, to compute, for each pair in the set, a respective baseline level that is indicative of a first number of the communication ports that source nodes, other than the given source node in the pair and other than any of the identified noisy scanners, accessed on the given destination node during a first time period, to compute, for each pair in the set, a respective test score that is indicative of a difference between a second number of the communication ports that the given source node in the pair accessed on the given destination node during a second time period and the baseline level, and to initiate a preventive action with respect to the given source node in any of the pairs for which the test score is greater than a specified level.
 11. The apparatus according to claim 10, wherein a given processor is configured to identify the pairs of source and destination nodes by collecting, from a probe, data packets transmitted over the network, aggregating the collected data packets into communication sessions between a given source node and a given destination node, and identifying, in each the aggregated communication sessions, the given source node accessing at least one given communication port on the given destination node.
 12. The apparatus according to claim 10, wherein the specified level comprises a first specified level, wherein a given processor is configured to initiate the preventive action by initiating a first preventive action, and comprising identifying any failed connections in the port scans during the second time period, and initiating a second preventive action with respect to the given source node in any of the pairs having at least one of the identified failed connections and for which the test score is greater than a second specified level lower than the first specified level.
 13. The apparatus according to claim 10, wherein the second time period is subsequent to the first time period.
 14. The apparatus according to claim 10, wherein the first and the second time periods have substantially identical time durations.
 15. The apparatus according to claim 10, wherein the first time period comprises multiple sub-periods, and wherein a given processor is configured to compute the test score for a given pair by computing separate baseline levels for each of the sub-periods, computing an average of the separate baseline levels, and subtracting the computed average from the second number of the communication ports.
 16. The apparatus according to claim 15, wherein the second time period and each of the sub-periods have substantially identical time durations.
 17. The apparatus according to claim 10 wherein a given processor is configured to initiate the preventive action by generating an alert for the given source node.
 18. The apparatus according to claim 10, a given processor is configured to initiate the preventive action by restricting access of the given source node to the network.
 19. A computer software product, the product comprising a non-transitory computer-readable medium, in which program instructions are stored, which instructions, when read by a computer, cause the computer: to identify, in data traffic transmitted between multiple nodes that communicate over a network, a set of pairs of source and destination nodes, each pair consisting of a given source node and a given destination node, and one or more communication ports accessed in the data traffic between the source and destination nodes in each pair; to identify one or more noisy scanners, each of the noisy scanners comprising a given source node that accesses at least a specified first number of destination ports on at least a second specified number of destination nodes; to compute, for each pair in the set, a respective baseline level that is indicative of a first number of the communication ports that source nodes, other than the given source node in the pair and other than any of the identified noisy scanners, accessed on the given destination node during a first time period; to compute, for each pair in the set, a respective test score that is indicative of a difference between a second number of the communication ports that the given source node in the pair accessed on the given destination node during a second time period and the baseline level; and to initiate a preventive action with respect to the given source node in any of the pairs for which the test score is greater than a specified level.
 20. A method, comprising: identifying, in data traffic transmitted between multiple nodes that communicate over a network, a set of pairs of source and destination nodes, each pair consisting of a given source node and a given destination node, and one or more communication ports accessed in the data traffic between the source and destination nodes in each pair; computing, for each pair in the set during a first time period comprising multiple sub-periods, respective baseline levels that is indicative of a first number of the communication ports that source nodes other than the given source node in the pair accessed on the given destination node during each of the sub-periods; computing, for each of the pairs in the set, an average of the respective baseline levels for the each of the pairs; computing, for each given pair in the set, a respective test score that is indicative of a difference between a second number of the communication ports that the given source node in the pair accessed on the given destination node during a second time period and the computed average baseline level for the given pair; and initiating a preventive action with respect to the given source node in any of the pairs for which the test score is greater than a specified level. 